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Measuring the Ability Infer 


RALPH TYLER 
necessity having objective evidence the 


degree which students are attaining each the 
important objectives education stressed edi- 
torial this issue the This need has been 
recognized several departments Ohio State University. 
Instructors these departments co-operation with the writer 
are attacking the problem developing methods for measuring 
the students’ attainment these objectives which are not usu- 
ally tested the ordinary examination. The work the 
Department Zodlogy may serve illustrate the general 
program now under way several other departments. 
result considerable deliberation, the instructors 
zodlogy have set the following goals toward which students 
should progress elementary courses: 


fund information about animal activities and structures 

understanding technical terminology 

ability draw inferences from facts, that is, propose 
hypotheses 

ability propose ways testing hypotheses 

ability apply principles concrete situations 

Accuracy observation 

Skill use the microscope and other essential tools 

ability express effectively ideas relating zodlogy* 


There were already available techniques which are appar- 
ently valid for measuring the degree which students have 
attained Objectives and test for microscope skill 
described elsewhere the while ways measuring 


formulation these objectives and the work the measurement has largely 
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described method measuring the ability student 
draw inferences from the facts which have been presented. 

drawing inferences from facts meant the ability 
propose reasonable inferences which serve interpret the 
related facts obtained from observation, reading, 
listening. essential any such test that the student 
presented with the necessary related facts which asked 
interpret. Furthermore, order that this may not measure 
memory, necessary that these related facts new the 
student. previously has heard these facts interpreted 
has read interpretation, his response likely represent 
his memory the interpretation made someone else, which, 
course, would not give measure his own ability draw 
inferences from facts. 


elementary class, the most productive source new 
related zodlogical facts the current research work this 
field. brief description recent experiment and its results 
presents material which typical the data which the student 
expected learn interpret. Since the facts are new, suc- 
cessful performance not matter mere memory. 
series such descriptions was prepared for use test items. 
illustration one the simpler items is: 

autumn, when the pond cold, frogs hibernate burrowing into the 


bottom the pond. the spring frog put into tank water goes the 
bottom and attempts burrow into the bottom the tank. 


After formulating series such items varying 
culty, the next problem was the method making the test 
objective. The ability propose his own inferences without 
suggestion was desired for the student. was hoped, how- 
ever, that this ability might highly correlated with the 
ability select the best inference from series alternatives, 
which could easily measured objectively 
response test. learn whether the two abilities were highly 
correlated group sixty-six students was given two tests 
the same day. The first test consisted series items 
preceded the following directions: 

Suppose each the exercises which follow that the facts given were 
observed known true. Write the space below the facts, the one 


inference which seems you most reasonable attempting interpret the 
facts. The sample illustrates this procedure. 
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SAMPLE 

when treated with iodine solution gives blue color. When 
saliva mixed with starch and left for time the mixture longer turns 
jodine solution blue. 

produces change starch. 


Remember write down the inference which seems most reasonable from 
given. 


These papers were then taken and the students were 
given the same items multiple-response form. The follow- 
ing directions were used this second form: 


each the exercises below several facts are given. After the facts are 
suggested number inferences. Supposing these facts are true, select the 
inference which seems you most reasonable and place check mark front 

SAMPLE 

Starch when treated with iodine solution gives blue color. When saliva 
mixed with the starch and left for time the mixture longer turns iodine 
solution blue. 

Saliva turns starch sugar 

Saliva turns iodine blue 

Vc. Saliva produces change starch 
The color saliva destroys the blue color 
Saliva has effect grape sugar 


Remember check the inference which seems most reason- 
able from the facts given. 


test items were the same both tests, but the differ- 
ence was the ability required the student. the 
first test, had propose his own inferences; the second, 
had select the most reasonable inference from list 
alternatives. correlation between the results these two 
tests shows whether the multiple-response test, which much 
more easily scored, could used place the first test 
measure the ability propose inferences. The method 
scoring the first test will described presently, showing that its 
score highly reliable. The multiple-response test was scored 
the usual manner. the split-half method its reliability 
was found However, when the results the first 
test were correlated with the results the second, the coefficient 
obtained was only .38. This indicates that there some rela- 
tionship between the tests, but means high enough for 
the multiple-response test used place the first test. 
The ability select the most reasonable inference from 
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given list not the same the ability propose original 
inference. 

Since the objective multiple-response test could not used 
measure the ability propose inferences, became necessary 
develop reliable method scoring the original inferences 
proposed the students. this end, several devices were 
tried. different inference proposed was typed upon 
separate 3-inch 5-inch card. names were placed the 
cards order that the instructor might not influenced. 
set these cards was given instructor zodlogy who 
was asked take each test item turn and sort the infer- 
ences into many different levels quality could iden- 
tify, placing the best pile the right and the poorest the left. 
record was made the instructor’s sorting, the cards were 
then shuffled and given each the other instructors turn. 
The number different levels quality recorded varied from 
four the case one instructor eleven the case another. 

given instructor’s judgments were given numerical scores 
assuming that all the responses, accurately measured 
scale merit, would normally distributed. this means 
value for each judgment can obtained terms the 
standard deviation this distribution, using the formula: 


Y,—Y.2 
Or pa F 
When 
the mean value given 
left ordinate the normal 
curve under consideration, 
and 


=the fractional part all the cards 
which were placed given pile. 


The values and may quickly obtained from published 
tables the normal curve. 

After the scores had been calculated for each student the 
basis each instructor’s judgment separately, the objectivity 
the method was determined. was found, the ratings made 
any three instructors chosen random were used and each 
student’s score obtained averaging the scores given the 
three instructors, that none these scores differ more than 
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from the scores obtained using the ratings all the ele- 
mentary instructors the department. The ratings three 
instructors, then, can used represent the combined judg- 
ment the entire department the quality the original 
inferences which have been proposed the students. 


device for evaluating these inferences, which 
much simpler use, was tried. Each instructor was given 
the set cards and asked place them five piles according 
merit, No. representing the best inferences and No. the 
poorest, attempting place the cards that the average 
quality pile No. was much better than the average 
No. No. was better than No. No. was better than 
No. and No. was better than No. This the method 
equal-appearing intervals. scoring, inference placed 
pile No. was given five points; one placed No. was given 
four points, and on. The average scores obtained this 
method were then correlated with the average scores obtained 
the method previously described, giving coefficient .98. 
The simpler method equal-appearing intervals, therefore, 
can used place the method. 

final check objectivity was made this Autumn Quarter. 
Several new instructors had been added the department, none 
whom had seen the set inferences had any idea 
their placement the instructors last year. Three these 
instructors were given the shuffled cards, and were asked 
place them the method equal-appearing intervals. The 
average scores given these three new instructors were corre- 
lated with the average scores given the instructors the year 
before. The resulting correlation coefficient .96 indicates 
that the scoring three zodlogy instructors not likely 
vary from year year nor depend upon the particular per- 
sonnel the staff, and that the merit inference not the 
purely subjective quality often claimed. 

addition making certain the objectivity the scor- 
ing, necessary know whether the number test items 
adequate for the determination the student’s ability the 
interpretation facts. Since the items used represent 
only sample all the facts which might conceivably 
expected interpret, the reliability the test depends upon the 
adequacy this sample, and was determined the split-half 
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method and the use the Spearman-Brown formula. The 
reliability coefficient thus obtained was .95. thus appears that 
the test both reliable and objective. 

Several persons who have constructed tests have maintained 
that all the important objectives were closely correlated 
with information that information test was all that was neces- 
sary measuring achievement. This not true 
The correlation between test covering the important 
cal information and this test proposing inferences was found 
only .29, entirely too low justify the use the infor- 
mation test alone measure both objectives. the ability 
must have measure for it, for cannot depend upon 
information tests indicate the student’s attainment this 
objective. 

final problem now under investigation. very 
necessary that this method measurement does not become 
test memory, hence the items must always new the stu- 
dents. This only possible selecting new items from current 
research for each new test. new items are continually used, 
there danger that they will differing difficulty from 
those used previously, resulting the condition that evi- 
dence student growth this function will obtained. 
are now trying formulate method which the difficulty 
new item can predicted accurately advance order 
that the test may consist new items while the same time 
can used measure student progress. 


Test Method Helps, Number Two, has recently been pub- 
lished the World Book Company. This booklet, Ross 
Runnels, “manual for determining the equivalence 
mental ages obtained from group intelligence tests.” The 
equivalent mental-age norms have been worked out for Na- 
tional Intelligence Tests: Scale Form Haggerty Intelli- 
gence Examination, Delta Otis Group Intelligence Scale, 
Advanced, Form Terman Group Test Mental Ability, 
Form and Otis Self-administering Test, Higher, Form 
The equivalent norms will useful comparing the results 
from intelligence tests since significant differences have long 
been noted the mental ages obtained from various tests. 


Objective Measurements English 
ELVA ANNE LYON 


BJECTIVE measurements English have been used, 
during the Autumn Quarter the College Agri- 
culture Ohio State University, one the factors 
the student’s analysis his abilities and aptitudes. The 
experiment further development the policy already 
operation for year several departments that college 
whereby objective measurements are used with the aim 
providing for and encouraging the fullest possible development 
each individual. This necessitates taking into account all 
available factors aptitude and ability and making avail- 
able the student the results the objective measuring 
these factors that may share with the faculty the respon- 
sibility using these known facts developing successful 
college career, which leads, turn, satisfactory participation 
wisely chosen profession. 

This program self-discovery for the Freshmen forms the 
core the one course required all them, the Survey 
Agriculture conducted Mr. Nisonger, the junior dean. 
was his invitation that the English experimentation has been 
carried out this course. Much the credit for whatever 
value the procedure has had must given his interest, sug- 
gestions, and co-operation. 

the College Agriculture, the regularly required Eng- 
lish course does not begin until the second quarter. The stu- 
dents, however, were given chance fall into the habit 
thinking that until the regular English course started English 
was importance. the survey course and elsewhere they 
were impressed with the fact that expressing their thought cor- 
rectly and accurately was essential successful college work. 
Certain the objective pre-tests botany and re- 
quired the interpretation observed facts and the drawing 
inferences from given situations. Difficulty expressing ideas 
clearly was shown definite handicap such tests. Fur- 
thermore, the general attitude instructors that botany, for 
instance, not good botany unless well expressed gives the 
students unmistakable evidence that English practical 
subject that functions all their college work and not depart- 
mentalized routine that operates only during the English class. 
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The felt need for improvement English thus stimulated 
set the stage for the introduction the analytical work early 
the year. But before describing the program was carried 
out, will well define little more clearly the objectives 
the experiment, the data, and the methods. 

The aim was not limited ascertaining what errors Eng- 
lish the present group students made the beginning and 
measuring how much improvement could effected 
given period. was felt that greater service could rendered 
locating the same time, and definitely possible, the 
factors which have limited the students’ facility correct use 
English, isolate far possible the principles involved 
helping the student locate his English difficulties and 
work out satisfactory solution for them, and seek for long- 
term improvements and results rather than for quick, and prob- 
ably temporary, panaceas. The data collected and the methods 
used have been based analysis the problems involved 
the situation and the objectives, both immediate and 
remote, that have been tentatively set for the study. The 
present purpose describe, rather than evaluate, the use 
objective methods defining the problems, suggesting solu- 
tions, and how intended use them measuring results. 

The data used thus far the study have been the following: 

The student’s high-school record, including his standing his 
class, that is, whether the upper, middle, lower third; the 
grades received his high-school English courses; and his atti- 
tude toward his high-school English 

The student’s rating tests taken upon his entrance the Uni- 
versity: psychological test, and reading test 

The Pressey Diagnostic English Test 

written paper prepared for the course, Survey Agriculture 

The student’s statement what feels his chief writing 
difficulties 

The data the student’s high-school record were obtained 

from the forms filled out making application for admission 
the University. attitude toward English was found, 
found all, his response questions the admission blank: 

What studies did you like Give reasons. 

What studies did you spend the most time on? 

What studies did you dislike most, like the 

least? Give reasons. 
What studies did you find most difficult? 
What studies did you work hardest on? 
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This, course, gave the student’s reaction English only 
when occurred one these groups, but surprising 
note, from the results given later, how frequently English 
appeared the answer the third fourth these questions. 
The data the psychological and reading tests were obtained 
from the report these tests furnished the College Agri- 
culture the Department Psychology. Only the case 
the Pressey English tests and the student’s statement his 
“felt writing difficulties” were data used that were not already 
available from routine, administrative sources, part 
regular class assignments the survey course. 


first project upon which the student worked the 
survey course was keeping record his expenditure 
time during his first week college work. form was fur- 
nished which provided spaces for recording the hours spent 
each day the classroom laboratory, the preparation 
lessons, outside activities (which was list), outside 
reading, outside work, and commuting. day two 
before report this project was handed in, the student 
was given this assignment sheet guide his writing: 
SUGGESTIONS FOR THE Discussion Your 

the activity record your first week college you have the basis for 
building efficient working methods which will determine great extent 
your success college student. thorough and critical examination this 
time budget should enable you detect which activities have been allotted 
time and which usurp time that should given things more 
worth while you. Test your program the questions suggested for thought 
which accompanied your record sheet, and others which occur you. 

One the best ways thinking problem through clearly and thor- 
oughly write out that will convey your ideas your reader 
direct and forceful way. You are asked this. Make clear him why 
you are organizing your time the way you are, what there the record 
your first week that you wish continue, and what you will attempt adjust 
meet your needs better. 

Your written discussion will evaluated the following points: 

The thoroughness your analysis your time budget re- 
lates your problems. 

The organization your discussion this analysis. 

Your choice language. 

The conciseness and clearness your expression. 


After this written assignment was handed in, class period 
was used for administering the Pressey English Test. mimeo- 
graphed report the results the test was provided for the 
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students, and the tests themselves were returned and discussed, 
with stress the reasons for the correct forms the specific 
grammatical rules involved, that this information would 
most likely function working tool the student’s 
writing. The report contained statement the mean score 
made the group 164 each the four parts the test, 
that each student could compare his rating with the average 
for the group. The mean score the group for the combined 
scores the four parts the test was 69.3. The mean score 
made last May about nine hundred high-school Seniors 
twelve high schools Ohio was The agriculture students 
were told that, consideration the facilities that were being 
provided assist them with their English difficulties, seemed 
reasonable expect them bring their average the 
end the quarter. The report also contained difficulty anal- 
ysis the items each part the test, compiled tabulating 
the frequency with which each had been missed and then 
arranging the items order from the one most frequently 
missed the one lowest frequency. Also, grouping errors 
according type was made, and methods correction indi- 
cated. The value this procedure was impress the student 
that his English was not hopeless, but that attention few 
major types error would eliminate most his trouble. 
résumé this information given the paragraphs which 
follow for those teachers interested diagnosing the errors 
their students similar method, who wish use the find- 
ings this study suggestive the principles they should 
stress their own classes indicate the errors likely 
found most prevalent the writing their students. 
SENTENCE STRUCTURE— 

Over per cent the total errors consisted failures recognize in- 
complete sentences, the punctuation two more complete sentences 
they were one sentence. 

Thirty-five per cent the errors involved incorrect indefinite ante- 
cedents pronouns. 

Twenty-two per cent the errors consisted incorrect placing words, 
phrases, clauses with relation the word they modified. 

Fourteen per cent the errors grammar consisted violations 


agreement between the subject and predicate person and number, chiefly 
following there is, when subject nouns are joined or, when 
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the verb made agree with noun standing immediately before which 
not the subject. 

Sixteen per cent the errors involved the correct number pronouns 
and verbs following such words each, neither, everybody, none, person. 

Eighteen per cent the errors consisted confusion the nominative 
and objective cases pronouns dependent clauses, and objects verbs 
prepositions such usage girls and between Mary and me. 

Incorrect adverbial form usage constituted per cent the errors. 


Almost per cent the errors capitalization occurred this sen- 
tence, “He lives the West, few miles north our old home”; and 
per cent involved these two sentences, will write Major Cox and 
Professor Bell,” and had been professor, but enlisted and rose the 
rank major.” 

Other items showing high frequency error were the capitalization 
such words high school, company, and club, adjectives other nouns derived 


from proper nouns, capitalization titles books and direct quotations, 
both continuous and broken. 


The highest frequency error involved these usages: The apostrophe 
indicate omitted letters and show possession; the setting off parenthetical 
expressions, and conjunctive adverbs such however and the 
punctuation used with “as follows” introducing enumeration; 
punctuation involved distinguishing between restrictive and non-restrictive 


clauses; and distinguishing between the use the comma and the semicolon 
between the members compound sentence. 


The written assignment was carefully read, the errors indi- 
cated, and rating given according the following standards: 


0.0 


Extremely poor work, even for beginning; the subject incompletely 
treated and badly organized; spelling errors prevalent; incomplete run-on 
sentences; errors tense principal parts verbs, agreement with sub- 


ject person and number; shift pronouns, incorrect antecedent and verb 
relationships with the pronoun. 


fair beginning; subject-matter reasonably complete and organization 
satisfactory for the most part; fairly good command sentence structure; 
some error agreement, pronouns and adverbs. 

2.1 

good beginning; treatment subject-matter adequate and organization 
good; not more than two gross errors fundamentals; sentences must gen- 
eral well constructed and show some variety. 

3.1 

Excellent work for beginning; free from all gross errors; treatment 
subject shows some originality; choice words good; sentences interesting, 
with variety structure, and punctuated for clearness meaning and ease 
reading. ‘The paper presents neat, well-planned, workmanlike appearance. 
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The limited class time permitted almost discussion the 
written assignments after they had been returned, but weekly 
conference period was announced which all students were 
welcome come talk over with the writer the specific diffi- 
culties which the Pressey test and the writing assignment had 
pointed out. The students who had made low rating (one 
which fell approximately the lower quartile either 
both) were especially urged take advantage this oppor- 
tunity, but case was the conference made compulsory. 
little more than half-day required each week take care 
those who take advantage this offer talk over their 
difficulties and develop methods overcoming them. The 
system has not been operation long enough enable the 
drawing conclusions the steadfastness this interest 
the part the student improving his English, test 
definitely the results the method. But quite apart from 
results, seems significant that many students have volun- 
tarily shouldered the responsibility for improvement their 
English when once their weaknesses have been located and the 
way provided for self-help. 


THE present time second report issued the stu- 
dents which incorporated the results the student’s 
statement his “felt difficulties” writing together with the 
investigator’s analysis his difficulties his writing shows 
them, and suggested general methods for overcoming these 
troublesome factors. Before the students had read the critical 
comments their returned papers, they were asked list the 
four five difficulties which bothered them most when they 
were writing, whether not these difficulties showed 
errors the finished product. The students’ “felt difficulties” 
are, order importance according frequency mention: 
punctuation, sentence structure, organizing material and para- 
graphing, spelling, clearness and conciseness expression, lim- 
ited vocabulary, the grammar involved the use pronouns, 
verbs, and adverbs, and legibility and good form the manu- 
script. The analysis the students’ writing was completed be- 
fore the statements felt difficulty were obtained; for that 
reason, well from attempt analyze the category 
errors agree less closely with the Pressey test, the 
categories the two tabulations not coincide sufficiently 
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strictly comparable. This analysis gave frequency error 
this order: sentence structure, including conciseness ex- 
pression; grammar, adverbs, pronouns, verbs, prepositions; 
punctuation; and spelling. 

The class conference discussions some these 
culties the test located them the first report the stu- 
dents treated them are, tentatively considered, sufficient. The 
second report will confined suggested self-help the 
topics spelling; legibility and form the manuscript; 
vocabulary building; development clearness and conciseness 
expression; some further grammatical work pronouns, 
verbs, and adverbs; and the organizing material and devel- 
opment paragraphs. Shortly after this report the 
hands the students another writing assignment for the survey 
course will read and analyzed note the improvement made 
and the errors which still persist, and which will need further 
treatment. the end the quarter objective tests will again 
administered, comparable the ones used the beginning, 
indicate definitely what changes have taken place the 
student’s English equipment. All data accruing from this 
experimental work will available the regular English 
instructors the following quarter use they see fit. 


MUCH for the program objective measurement Eng- 

lish factor the student’s self-analysis and self-help 
with regard the abilities which are closely related college 
success. Further treatment the data available will give some 
idea what the study seems, present, reveal the 
broader sense concerning the English preparation these col- 
lege Freshmen. 

The use objective test English which consists 
testing the student’s ability detect error grammar and 
sentence structure printed form, punctuate and capi- 


talize correctly sentences already written, always raises the 


question the relation success failure such tests 
the student’s ability apply this information his own 
writing. The average length the composition read this 
study was about two hundred words, certainly much too short 
give high degree expectancy finding every error made 
the Pressey test duplicated the written work. The an- 
alysis many thousand words the student’s written work 
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would certainly needed for comparative study that 
type. The present investigation did make attempt com- 
pare the general quality the work the student the test 
and his written work correlating his test score with the 
rating given his composition the four-point scale already 
described which does, course, involve much more the 
subjective element than the test score. The summary which 
follows indicates the correlation coefficient found between these 
two ratings, and between various other ratings that were 
thought sufficient importance warrant this statistical 
treatment: 


Ratings Correlated 

Total score Pressey test correlated with 

Mean high-school English marks 
Composition rating and mean high-school English marks ......... 
Intercorrelation the Pressey test 


The writer suspicious the relatively high correlation ob- 
tained between the Pressey score and the composition rating, 
and until much more experimental evidence hand would 
much prefer attributing “luck” “chance” rather than 
claiming that highly efficient method rating writing ob- 
jectively has been discovered. may noted that there 
decidedly higher correlation between the ratings established 
wholly partially objective measurements used the Uni- 
versity than between the grades received for high-school work, 
which may may not have been arrived objective meth- 
ods, and the ratings received university tests. Only care- 
ful study the student’s subsequent university attainment 
can indicate whether the high-school the university rating 
most accurately prognostic his success the University. 
Correlation coefficients between the parts the Pressey 
test that are only moderately high might taken indicate 
that English ability not general and uniform, but rather 
made specific abilities. student, for instance, may 
reasonably proficient capitalization and punctuation and 
the same time quite lacking the ability write correct 
sentences use the parts speech with due regard gram- 
matical rules. Developing integrated writing ability would 
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seem indicate the necessity locating the specific difficulties 
and attacking them first. 

The rather low mean score which this group Freshmen 
made the Pressey test raised the question whether the 
group was generally low whether English was the exception. 
The mean ranking the group the Intelligence Percentile 
Scale and the Reading Percentile Scale was found 45.9 
and 42.7, respectively. The ranking these students their 
high-school classes was found follows: per cent 
were the upper third their class; per cent, the 
middle third; and per cent, the lower third. These 
figures indicate that low English rating probably cannot 
explained entirely the basis low general scholarship. 

The attitude toward high-school English was next consid- 
ered. the admission blanks, English was listed among the 
studies liked best studied most twenty-four students, 
while 116 listed English their most disliked most difficult 
subject. The reasons for the dislike English most frequently 
given were that was not practical, did not have any con- 
nection with other studies, that they could not understand it, 
that they had difficulty writing, that they could not learn 
strings dates memorize poems. other words, for this 
group students, coming from more than hundred high 
schools, English “did not get across.” The present experiment 
seems indicate, even this early, that possible detect 
objective means some the factors involved this failure, 
and hopes formulate before concluded some methods 
for overcoming these difficulties and test their value ob- 
jective measurements. 


That interest the testing movement has become marked 
colleges and universities shown recent grant 
$500,000 made the General Education Board the Ameri- 
can Council Education. This subvention, which 
expended over ten-year period, will used develop 
achievement tests which will employed for the rating 
students American colleges and universities. 


What High-School Pupils Forget 


RALPH TYLER 
statement frequently made that what little high- 


school students learn quickly forgotten. some 

cases, this claim has been supported test records 
showing that pupils made distinctly lower scores year after 
taking given course than the close the course. con- 
sidering the nature this forgetting, seemed probable 
the writer that information gained course which seldom 
needed thereafter would, general, forgotten most quickly. 
the other hand, explanations which can commonly ap- 
plied the everyday happenings life and think- 
ing which are frequently used might expected retained 
high-school pupils with much less loss. 

investigate this hypothesis experiment was planned 
and conducted through the co-operation the teachers two 
high schools Columbus County, North Carolina. The ninth 
grade each school was composed pupils who had taken 
general science the previous year. Appropriate tests given 
September would indicate what was retained these pupils 
after four months had elapsed since completing the general- 
science course. Since this subject was not taught the ninth 
grade, similar tests given eight months later would show the 
amount forgetting during school year which the pupils 
were not taking science. 

measure the amount information which the pupils 
remember from their general science, the Ruch-Popenoe Gen- 
eral Science Test was used. Form this test was given 
September and repeated eight months later. measure the 
ability explain everyday scientific phenomena and general- 
ize from given scientific data, two new tests were developed. 
The explanation test was multiple-choice test consisting 
fifty items. Each item consisted statement commonly 
observed scientific phenomenon followed five alternative 
explanations, the best one which was checked. 
illustration one the test items is: 


less likely frost cloudy nights late autumn than clear 
nights, because: 
will not frost when the air damp. 
The clouds reduce the radiation heat from the earth. 
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Clouds reduce the amount heat received from the sun. 
always colder clear nights. 
There more moonlight clear nights than cloudy nights. 


This test was given the same time that the Ruch-Popenoe 


test was given and was also repeated eight months later. 


The third test was one requiring generalization from given 
facts. item consisted brief description experi- 
ment. The results obtained from the experiment were given, 
followed five alternative inferences which might drawn 
from these results. The pupils were asked check the in- 
ference which was most reasonable from the facts given and 


the one least likely true. example one the test 
items is: 


piece glass immersed water comes out wet, when immersed 
mercury comes out without any mercury adhering it. When small glass 
tube placed basin water, the water rises the tube above the level 
the water the basin. When the tube placed dish mercury, the 
level the mercury the tube not high the mercury the dish. 

The attraction between particles mercury and glass greater than 
the attraction between mercury particles, hence the mercury pulled down 
the tube. With water and glass the reverse true. 

The attraction between particles water and glass greater than the 
attraction between water particles, hence the water pulled that there 
greater contact with the glass. With mercury the reverse true. 

This example osmosis which causes the water rise and the 
mercury depressed. only takes place small tubes. 

The surface tension water much greater than that mercury 
shown the fact that water wets glass. Hence the surface the water rises 
the tube. With mercury the reverse true. 

The molecules water must much smaller than those mercury 
that they can rise small tubes easily. The large mercury molecules encounter 
difficulty moving the tube. 


This test was also given September and repeated eight 
months later. The reliability the two new tests were .89 
and respectively. Table gives the mean scores made 
the initial tests well those received the final tests. 

probably expected that there was distinct drop 
between the initial and final scores the Ruch-Popenoe test 
which measures largely information. The mean score Sep- 
tember was 27.9 and was only 16.8 May. the other 
hand, there was little difference between the initial and the 
final mean scores the other two tests. seems apparent 
that for these pupils the greatest loss during the year was 
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information about science, whereas there was little loss 
the ability explain everyday phenomena and the ability 
generalize from given facts. 

corresponding results found quite generally, they are 
great significance. There would then justification for 
concluding that the permanent values high-school science 
are not much the information retained the ability ex- 
plain scientific terms and the ability generalize from 


TABLE 


Mean Mean Loss Probable 


Test Initial Eight Error 
Pupils Score Months Mean Loss 
Ruch-Popenoe .......... 27.9 
Generalizations ......... 21.7 1.2 


known facts. this true, the common method deter- 
mining what pupils obtain from high-school education 
giving them range-of-information test incomplete. These 
other abilities less likely fade overnight must also 
plumbed before are justified concluding that high-school 
students quickly lose all that they have learned their four 
years training. They may forget information which little 
used thereafter, but quite possible that they retain those 
methods thinking given field for which there more 
frequent occasion for use. 


The Supervisory Service Bureau the School Education 
the University Wisconsin Madison has recently pub- 
lished the Torgerson Diagnostic Teacher Rating Scale which 
consists series statements describing rather definitely 
ninety practices followed teachers. The supervisor checks 
those statements which she believes apply the teacher being 
rated. The validity the scale has not been determined al- 
though the author states that “every aspect teaching listed 
the scale correlates positively with teaching success.” The 
scale reported highly objective and reliable. 


RALPH TYLER 
and reliable methods for testing the amount 


information which student remembers have been de- 
veloped recently many fields subject-matter. 

Equivalent progress developing other types measurements 
has not been made. Few tests are yet available for measuring 
the degree laboratory skill attained the students, yet 
the natural sciences one objective develop certain labora- 
tory skills which are essential for getting first-hand contact with 
scientific phenomena. 

The necessity for test microscope skill botany and 
classes Ohio State University became apparent when 
was found that some students advanced classes were unable 
use microscope effectively. These students had taken two 
courses which they had been expected use the microscope, 
yet few them were totally unable manipulate the 
instrument. They had been unable these two courses 
make any first-hand observations the minute structures 
which most the life processes plants and animals take 
place. discover these students who are not learning use 
the microscope early, determine the nature their difficul- 
ties, and provide the necessary remedial instruction that 
they may gain the essential proficiency was necessary de- 
vise test skill using the microscope.* Such test should 
administered after the students had been given the usual 
class training using microscope, order identify those 
who had not attained the desired skill. 

The writer made series observations the reactions 
individual students for the purpose determining the types 
difficulties students encounter using the microscope. Thirty- 
three students Botany 402, the second half the elementary 
course, were selected for observation. Using room which was 
unoccupied for the day, two microscopes were placed table 
together with yeast culture, slides, covers, cloth, and lens paper. 
The writer placed himself position which gave him good 
view the actions anyone the table. The instructor, 
Miss Pearle Williams, then called from the laboratory one 


construction the test was suggested Sampson the Department 


Botany and was developed through his help and the co-operation Barrows the 
Department Zodlogy. 
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student When the student came she told him that 
the bottle contained some yeast culture and asked him, 
quickly could, find yeast cell under the microscope 
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and show the cell her when had found it. 


STUDENT'S ACTIONS 


a. Takes slide 
b. Wipes slide with lens paper 
c. Wipes slide with cloth 
d. Wipes slide with finger 
¢. Moves bottle of culture along the table 
f. Places drop or two of culture on slide 
g. Adds more culture 
h. Adds few drops of water 
i. Hunts for cover’ glasses 
j. Wipes cover glass with lens paper 
k. Wipes cover with cloth 
1. Wipes cover with finger 
m. Adjusts cover with finger 
n. Wipes off surplus fluid 
o: Places slide on stage 
p- Looks through eyepiece with: right eye 
q. Looks through eyepiece with left eye 
r. Turns to objective of lowest powcr 
s. Turns to low-power objective 
t. Turns to high-power objective 
u. Holds one eye closed 
v. Looks for light 
w. Adjusts concave mirror 
x. Adjusts plane mirror 
y- Adjusts diaphragm 
z. Does not touch diaphragm 
aa. With eye at eyepiece turns down coarse 
adjustment 
ab. Breaks cover glass 
ac. Breaks slide 
ad. With eye away from eyepiece turns down 
coarse adjustment 
ae. Turns up coarse adjustment a great dis- 
tance 
af. With eye at eyepiece turns down fine ad- 
jostment a great distance 


SKILLS IN WHICH STUDENT NEEDS 
FURTHER TRAINING 


a. In cleaning objective 
b. In cleaning eyepiece 
¢. In focusing low power 
d. In focusing high power 
¢. In adjusting mirror 
f. In using diaphragm 
§ In keeping both eyes open 
In protecting slide and objective from 
breaking by careless focusing 


Fic. Check List Student Reactions Finding Object under the Microscope 


each student began, the writer noted the time and made 
record the sequence actions. When the student finished, 
the time was recorded together with brief characterization 
the quality the student’s mount. From these thirty-three 
observations preliminary list actions, both desirable and 
undesirable, was obtained. This preliminary list was supple- 
mented more than fifty additional observations. portion 


ACTIONS 
ag. With eye away from eyepiece turns down — 
fine adjustment a great distance * Pe =... 
ah. Turns up fine adjustment screw 2 great 
distance 
ai. Turns fine adjustment.serew a few turns. 
aj..Removes slide from stage a. 


ak. Wipes objective with lens paper 
al. Wipes objective with cloth conan 
am. Wipes objective with finger 
an. Wipes eyepiece with lens paper sie 
ao. Wipes eyepiece with cloth 
ap. Wipes eyepiece with finger 
aq. Makes another mount 

ar. Takes another microscope 

as. Finds object 

at. Pauses for an interval | 

au. Asks, “What do you want me to do?” 
av. Asks whether, to use high power 

aw. Says, “I’m satisfied” 
ax. Says that the mount is a!l right for hiscye 

ay. Says he cannot do it AL24. 


az. Told to start new mount Sehecations 
aaa. Directed to find object under low power BD... 


aab. Diftected to find object under high power 
NOTICEABLE CHARACTERISTICS OF 
_ STUDENT'S BEHAVIOR 

2. Awkward in movements 
b. Obviously dexterous in movements Silica 
¢. Slow and deliberate — a 
Very rapid 
Fingers tremble 
f. Obviously perturbed 
g- Obviously angry 
h. Does not take work seriously chains 
i. Unable to work without specific directions ..1/ 
j. Coviously satisfied with his unsuccessful efforts 


CHARACTERIZATION THE 
STUDENT'S MOUNT Actions 

a. Poor light 

b. Poor focus 

c. Excellent mount 

d. Good mount 

e. Fair mount 

f. Poor mount 

g. Very poor mount oon 

h. Nothing in view but a thread in his cyepicce ..... 

i. Something on objective moins 

j. Smeared lens 

k. Unable to find object ples 


the final form the check list shown Figure 


Sequence 
of 
Actions j 
{ 
Sequence 
Actions 
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check list now used requires little writing. The 
arrangement substantially the same when was first 
developed. botany student asked find yeast cell, 
whereas student asked find blood cell. rec- 
ord the kept, but instead writing down his actions 
the check list contains all the actions both desirable and 
undesirable which have been observed thus far. The observer 
records the sequence actions placing the figure after the 
description the student’s first action, figure after his sec- 
ond, and on. The sample reproduced here has been filled 
out for illustration. reading the actions the order they 
are numbered one gets detailed description the student’s 
procedure finding object under the microscope. 

examination the sample will serve indicate the 
usefulness the test. clear that this student very defi- 
cient microscope skill. used eleven minutes without 
finding the cell, whereas the better students find object 
approximately two minutes. does not adjust mirror nor 
diaphragm, closes one eye and turns down the coarse adjust- 
ment, while his eye eyepiece—all habits which must 
remedied before can use microscope skillfully. would 
expected, has poor light the object, has smeared 
objective, and unable find the object. needs training 
each the specific microscope skill. Hence, this 
test record gives the instructor analysis the student’s 
behavior which permits the selection the individual students 
who need training and determination the specific phases 
training which each student needs. 

The most promising part the procedure the discovery 
that students deficient microscope skill can quickly trained 
reasonable proficiency. Miss Blanche Montgomery has 
had charge the remedial training the Department 
She has been giving the test followed remedial 
instruction large number students. She finds that ten 
minutes her time the first day for instruction, followed 
five minutes for check week later, will provide the neces- 
sary guidance for more than half the students. After the 
ten minutes instruction the student spends several periods 
practice, with result that more than per cent the stu- 
dents have become sufficiently skillful use the microscope 
effectively. 
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microscope test is, course, individual test which 

requires certain amount the instructor’s time. Ordi- 
narily, ten twelve students can tested hour, but 
use such individual test with each the seven hundred 
students Botany 401 would require nearly two weeks for 
one instructor. Only the persons who are having difficulty 
using the microscope need observed. means 
easily administered group test those who should take the indi- 
vidual test may selected. the group test, the entire 
section the laboratory seated microscope tables each 
which provided with the necessary materials. The instructor 
informs the class that they are see how rapidly they can find 
object under the microscope. They are all commence 
when gives the signal and stop when gives the second 

The second signal given the end three minutes, 
and the instructor quickly passes from microscope micro- 
scope noting those students who are unable find the object 
three minutes. This same group test given second time 
several days later. Those students who were not able find 
object three minutes both tests are then given the indi- 
vidual test. This group test requires only ten fifteen min- 
utes administer, but means the instructor identifies 
about three-fourths the class who are already proficient with 
the microscope. This means that class forty only ten 
need given the individual test. These ten tests can 
given hour, which not heavy burden any instructor. 

The microscope test not paper-and-pencil test, but 
objective device for determining the degree skill possessed 
students biology. its administration record obtained 
which permits the instructor determine the type remedial 
training needed and give that training with minimum ex- 
penditure time and effort. The student thus trained has 
acquired skill the use essential tool biology. 


EDITORIAL COMMENT 
students education “objective measurement” 


synonymous with paper-and-pencil test, such 

list true-false statements. Where used with such 
restricted connotation objective measurement not para- 
mount importance. 

such restriction should placed upon the term. The 
members the Achievement-Test Division the Bureau 
Educational Research the development objective tests are 
concerned with techniques for obtaining any objective evidence 
the degree which students are attaining the important 
goals education. one purpose course literature 
the development desire read good books, then need 
method for finding out the degree which students are devel- 
oping this desire. purpose hygiene the development 
healthful personal habits, need techniques for determin- 
ing how adequately such habits are being developed among 
school children. other means can adequately evaluate 
our teaching procedures, can diagnose our pupils’ difficulties, 
and can determine the effectiveness remedial training. 

Objective measurement conceived has two major tasks, 
the invention ways measuring all important aspects 
educational development and the elimination unreliable, 
widely varying, subjective estimates student’s attainment. 
invent ways measuring all important aspects educa- 
tional development means that must not restrict ourselves 
paper-and-pencil tests alone—least all some particular 
form paper-and-pencil test. Valid evidence student’s 
attainment first importance whether that demands observa- 
tion the student’s habits, evaluation his oral speech, 
what not. After have invented ways getting valid evi- 
dence, the attempt eliminate subjectivity and reduce the 
cost and difficulty measurement without reducing the validity 
the measures should made. These are the research tasks 
those engaged objective measurement. 

This broader definition objective measurement makes 
clear its fundamental importance. The ultimate value any 
method teaching, scheme administration, any 
educational device can only determined terms measures 
the degree which this method this device facilitates the 
attainment the desired objectives education. 
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READINGS 


“Student Help the Growth the Teacher,” Educational Method, 
(October, 1930), pp. 30-33. 

This teacher sought means evaluating herself teacher. She could 
trust neither her own judgment nor the professional promotion which she had 
received. She wondered how she had measured the standards pupil 
improvement. She asked her students criticize her teacher and person, 
the course study, and the method conducting it. She urged them 
truthful and state the case favorably adversely just they thought. 
They were told not sign their names she did not want know who 
wrote the papers. 

The papers were analyzed for list qualities. ‘Twenty-one qualities 
were recognized and approved the students. qualities with terms 
which the students described the qualities are given the article. She 
also presents list adverse criticisms. 


“An Analysis Pupil Ratings High School 
Teachers,” Educational Administration and Supervision, 
ber, 1930), pp. 440-46. 

The writer this study obtained and analyzed some the factors which 
may enter into pupils’ judgments teachers and compared pupils’ judgments 
with those supervisors and teachers. The pupils were given sheets paper 
and asked rank their teachers during the current semester the following 
questions: (1) “For which teacher you work the hardest?” (2) 
teacher you like the most?” (3) teacher has the best order 
(4) which teacher you learn the most?” (5) “Which 
teacher you think the best most efficient teaching?” The super- 
visors also ranked the teachers for their teaching efficiency. Furthermore, 
each teacher ranked many his associates felt competent appraise 
terms his own training and experience. 

The statistical technique correlation was used determine the relation- 
ship among the rankings the various groups and the rankings the pupils 
the five questions. 


The coefficients seem indicate the pupils’ judgments teaching efficiency are 
based upon factors which supervisors would recognize important. Though the rankings 
not correlate high degree with supervisors’ rankings, there evidence that the 
latter are valid criterion for judging the rankings. may 
possess knowledge their teachers’ efficiency which would valuable supervisors. 


Cureton, Epwarp E., anp Jack “Scoring the Rearrange- 
ment Continuity Test,” School Review, XXXVIII (October, 1930), 
pp. 613-16. 

The continuity test special adaptation the rearrangement test 
applied the chronological order historic events. ‘The writers this 
article believe that the best measure the pupil’s knowledge this type 
material the correlation between his own ranking and the correct ranking. 
Spearman’s rank-difference correlation formula used modified form 
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obtain percentage results. Since such computation for each score would 
laborious, they have worked out nomograph whereby the percentage 
score can determined connecting two points and extending the line 
the percentage scale. The two points connected are the sum the differ- 


ences between the correct order and the pupil’s order squared, and the number 
items the test. 


Ruton, Justin, Scoring Technique for 
Tests Having Multiple Item-Weightings,” Personnel Journal, (October, 
1930), 235-41. 

such test the Strong Vocational Interest Blank, which there are 
420 possible responses evaluated twenty different ways for twenty different 
vocations, the work scoring and tabulating becomes prodigious and expen- 
sive. Hollerith Printing Tabulator machine which alleviates this 
burdensome work. This article gives helpful description how the Holle- 


rith machine may used score weighted responses and tabulate the results 
such complicated tests. 


Journal Educational Research, (October, 1930), pp. 205-10. 

Eight techniques are listed for measuring the degree success pupils 

performing activities, observation, personal interview, group conference, 

diary, question blank, check list, rating scale, and standardized objective tests. 

Since many the results education cannot present quantitatively 

measured, this article makes appeal for the inclusion other than purely 
quantitative techniques courses measurement for teachers training. 


Tabulation and Analysis Sheet for New Type Tests,” 
University High School Journal, (August, 1930), pp. 160-75. 

cross-hatch tabulation technique for recording the responses indi- 
viduals the items test presented. places the entire results 
class test one sheet paper, which enables one analyze easily the items 
the test and the pupils’ responses. may used show relative pupil 
achievement and diagnose pupil and class difficulties. 


Paper Covers 


Presson, Achievement Tests Biology for Secondary School Use 
Based upon Analysis the Content the Subject. Doctor’s disser- 
tation file the Library the University Pennsylvania, 1930. 
149 

The purpose this study the construction standardized achievement 
tests the field secondary biology. The validity the content the 
tests was established careful analysis the subject-matter that taught 
the tenth grade high schools. Separate tests have been devised for 
plant biology, and animal and human biology. The reliability two forms 
each test fairly high. Administering both parts each test requires 
minutes, which means the test can given and scored one class period. 

information test, its validity well established. However, informa- 
tion but one the objectives education; certain attitudes, appreciations, 
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and skills are important and perhaps more than information. There 
need fill the gaps our testing programs with reliable methods 
measurement which will more completely evaluate the outcomes the educa- 
tive process terms all its important objectives. 

Frep 


Educational Tests for Use High Schools, Third Revision. 
Urbana, Illinois: University 1929. pp. (University 
Illinois, Bureau Educational Research, Bulletin No. 53) 

The purpose and administration standardized educational tests for high 
schools are discussed the light their functions, present status, and limita- 
tions. Five criteria, which are valuable selecting tests, are given. list 
references testing which will helpful those engaged teaching 
instituting testing program included. 

Frep 


Results the May Testing Programs. Ann Arbor, 
Michigan: Bureau Educational Reference and Research, School 
Education, University Michigan, 1929. pp. (Bureau Educa- 
tional Reference and Research, Bulletin No. 126) 

Results the May Testing Programs. Ann Arbor, 
Michigan: Bureau Educational Reference and Research, School Edu- 
cation, University Michigan, 1929. pp. (Bureau Educational 
Reference and Research, Bulletin No. 132) 

These reports give the results the testing programs which were 
carried the elementary schools and high schools the state 
Michigan during the years 1928-29 and 1929-30. results the 
programs are presented series tables. The results for both cities 
and population groups are reported terms grade-levels instead 
terms raw scores. Standards are given for the different groups cities, 
for the state whole, and for other comparable groups. author 
believes possible determine the amount growth made grade 
during the year comparison the October and the May scores the 
same group. October tests may serve pre-test determining what 
the students not know and what should emphasized during the year. 


Accomplishment. Pasadena, California: Pasadena 
City Schools, 1929. pp. (Administrative Research Series, 1929, 
Monograph No. Bureau Administrative Research, Pasadena City 
Schools) 

The testing and measurement techniques employed the research staff 
the Pasadena schools evaluating educational accomplishment set forth 
this bulletin. Graphs and tables indicate the holding power the school 
and the degree ageness, over-ageness and under-ageness the pupils. Nine 
factors are considered basis for ability grouping, but practice the group- 
ing largely based intelligence quotients and educational quotients. The 
number exceptional cases not properly classified this way significant 
and points the necessity for more detailed study individual children. 
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Books Read 


Laura Prediction Success Professional Courses for 
Teachers. New York: Teachers College, Columbia University, 1930. 
(Teachers College, Columbia University, Contributions 
Education, No. 420) 

This statistical study which aims aid educational guidance the 
level the graduate school. The book gives review the findings 
previous studies determine the value psychological examinations 
predictive measures. The scores the four predictive tests and the average 
grades academic achievement, the criterion, are converted into T-scores, 
and the reliability each the predictive tests was determined the 
half-test method. The partial correlation method was used develop the 
regression equation for predictive purposes. The research was further con- 
tinued study the discrepancies between academic grades and the predictive 
tests scores. comparative study was made the candidates for the Mas- 
ter’s degree and for the Doctor’s degree, the differences among departments 
Teachers College, sex differences, and age differences. The study 
suggests the inclusion other predictive factors and the success life- 
work rather than academic success the criterion. 


The Construction and Validation Art Test. 
New York: Teachers College, Columbia University, 1929. pp. 
(Teachers College, Columbia University, Contributions Education, 
No. 383) 

This dissertation describes the construction and validation test art 
appreciation. The test includes sixty-one items, each comprising four illus- 
trations art objects. The pupils are asked rank the four illustrations 
each item the order artistic merit. The illustrations have been chosen 
that the total test includes variations line, notan (dark and light), and color. 

The ranking these illustrations which the test scored was deter- 
mined the consensus the rankings one hundred judges who are art 
dealers, critics, buyers, and advanced students. The items finally included 
the test are those which there was substantial agreement among the judges 
the relative order merit. The basis for scoring selected, because had 
the highest reliability, was allow one point for each illustration which was 
ranked the exact position given the judges. 

using this test, two criticisms need considered. the first place, 
called test “art appreciation.” The term “appreciation” has two 
common implications. often used mean enjoyment. this sense, 
test “art appreciation” would measure the student’s enjoyment 
objects possessing artistic merit. This important quality measure, but 
the test does not provide such measure. The term “appreciation” also 
used mean the recognition objects which most nearly meet the accepted 
standards art. this respect the test art-appreciation test, but the 
two meanings the term are not identical. possible for pupil 
recognize painting master one which considered artistic, and still 
the pupil might really prefer see the pictures the Police Gazette. one 
seeking test art enjoyment, this test not suitable. 
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The second criticism the selection the method scoring. 
scoring method was chosen because had the highest reliability. Scoring 
matter validity well reliability. the proper order the illustrations 
pupil ranking them would receive points credit, 
although had recognized the descending value and had placed 
proper relative position all the illustrations except No. the other 
hand, pupil ranking them would receive one point credit, 
although none are proper relative position except No. determine 
the method scoring such test problem validity well reliability; 
hence, the scoring method adopted may criticized. Nevertheless, the test 
will useful providing measure for the development pupils’ judgment 
artistic merit. 


Mann, Type Tests Engineering Education. New 
York: McGraw-Hill Book Company, 1930. 122 pp. 

This book gives account study construction, administration, 
and evaluation series objective type tests designed for use college 
engineering drawing and descriptive geometry. The principal purposes 
constructing the tests have been for the measurement the knowledge 
subject-matter, the power think terms it, and the ability make 
the desired useful application the course materials. ‘Two types tests 
have been constructed, one being the placement examination for the 
determination aptitudes and training for engineering drawing and 
descriptive geometry and the other, the achievement type which purports 
measure classroom accomplishment. 

Frep 


Educational Measurement the Elementary Grades. Yonkers, 
New York: World Book Company, 1930. 294 pp. 

This book, which belongs the Measurement and Adjustment series, 
edited Terman, concerned with tests for the elementary 
grades. written that can used text for most the beginning 
courses educational measurements. brief history measurement 
education and individual differences presented give the background 
out which objective tests have developed. Such procedure tabulation, 
well measures central tendency, measures variability, percentiles, corre- 
lation, and other statistics needed teachers are discussed. Validity and 
reliability tests treated briefly but rather well. The book includes material 
intelligence tests, the assumption that the educational test program 
incomplete without it. 

The chapter achievement tests gives fairly good description 
number typical tests for all the elementary grades. Included are survey 
tests, diagnostic tests, and practice tests. The chapter educational uses 
standardized tests perhaps the most useful chapter the entire book. The 
testing program for the entire school year outlined, emphasizing the 
necessity action after the tests have been given and scored. bibliography 
given the end each chapter. 


